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A successful state transfer (or teleportation) experiment must perform better than the benchmark 
set by the 'best' measure and prepare procedure. We consider the benchmark problem for the follow- 
ing families of states: (i) displaced thermal equilibrium states of given temperature; (ii) independent 
identically prepared qubits with completely unknown state. For the first family we show that the 
optimal procedure is heterodyne measurement followed by the preparation of a coherent state. This 
procedure was known to be optimal for coherent states and for squeezed states with the 'overlap 
fidelity' as figure of merit. Here we prove its optimality with respect to the trace norm distance and 
supremum risk. For the second problem we consider n i.i.d. spin-| systems in an arbitrary unknown 
state p and look for the measurement-preparation pair (M n ,P n ) for which the reconstructed state 
Un := P n o M n (p® n ) is as close as possible to the input state, i.e. \\cj n — p® n \\i is small. The 
figure of merit is based on the trace norm distance between input and output states. We show that 
asymptotically with n the this problem is equivalent to the first one. The proof and construction of 
(M n ,P n ) uses the theory of local asymptotic normality developed for state estimation which shows 
that i.i.d. quantum models can be approximated in a strong sense by quantum Gaussian mod- 
els. The measurement part is identical with 'optimal estimation', showing that 'benchmarking' and 
estimation are closely related problems in the asymptotic set-up. 

PACS numbers: 03.67.Hk, 03.65.Wj, 02.50.Tt 



I. INTRODUCTION 

Quantum teleportation [1 and quantum state storage 
[2] are by now well-established protocols in quantum in- 
formation science. In both cases the procedure amounts 
to mapping one quantum state onto another (at a re- 
mote location in the case of teleportation), by making 
use of quantum correlations in the form of entanglement 
or interaction between systems. However, approximate 
transformations could also be accomplished without any 
use of quantum correlations, by means of classical 'mea- 
sure and prepare' (MAP) schemes. Whilst in the ideal 
case, the entanglement resource gives quantum telepor- 
tation a clear advantage in terms of performance, there 
exists inevitable degradation of the quantum channel in 
realistic implementations. This has led to a number of 
investigations into the existence of optimal MAP schemes 
to locate classical-quantum boundaries and assign precise 
benchmarks for proving the presence of quantum effects 
[3]. Any experimental demonstration of quantum tele- 
portation and state storage has to perform better than 
the optimal MAP scheme, to be certified as a truly quan- 
tum demonstration. A review of the quantum bench- 
marks for completely unknown pure input states of d- 
dimensional systems can be found in [4 . More recent 
research has largely focused on benchmarks originating 
in the context of teleportation and quantum memory for 
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continuous variable (CV) systems [5], with notable re- 
sults obtained for transmission of pure and mixed coher- 
ent input states, and squeezed states [6UTU]. Beautiful ex- 
periments [TTHT4] involving light (Gaussian modes) and 
matter (coherent and spin-squeezed atomic ensembles) 
have demonstrated unambiguous quantum teleportation, 
storage and retrieval of these infinite-dimensional quan- 
tum states with a measured 'fidelity' between input and 
output exceeding the benchmark set by the optimal MAP 
strategy (see also [15j [16] ) . 

In each of the above cases, the benchmarks deal with 
the case of teleportation or storage of single input states 
drawn from a set, in a Bayesian or pointwise set-up. To 
date, there exist no nontrivial benchmarks for the trans- 
mission of multiple copies of quantum states - a 'quan- 
tum register' - in particular for an ensemble of n inde- 
pendent and identically-distributed (i.i.d.) qubits. Such 
a task comes as a primitive in distributed quantum com- 
munication. Quantum registers can be locally initialised 
and then transferred to remote processing units where a 
quantum computation is going to take place. Also, in 
hybrid interfaces between light and matter [TT] . storage 
and retrieval of e.g. coherent states, involves mapping 
the state of n i.i.d. atoms onto a light mode (back and 
forth). Therefore, strictly speaking, a quantum bench- 
mark for this precise input ensemble would be needed 
to assess the success of the experiment. In the current 
practice [12] [13] the problem is circumvented by noting 
that the collective spin components of the atomic ensem- 
ble (with n ~ 10 12 [18] ) approximately satisfy canonical 
commutation relations, henceforth the atomic system is 
treated a priori as a CV system, and the corresponding 
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benchmarks are used. 

In this paper we put this procedure on firm grounds, by 
proving rigorously that the optimal MAP scheme for tele- 
portation and storage of n i.i.d. unknown mixed qubits 
converges when n — >• oo to the optimal MAP scheme 
for a single-mode displaced thermal state. Additionally, 
we also prove that the heterodyne measurement followed 
by the preparation of a coherent state is optimal MAP 
scheme for displaced thermal states when the figure of 
merit is the trace norm distance. The same scheme is 
known to be optimal for thermal and squeezed states, 
but for a figure of merit based on overlap fidelity [9]. 

The key tool in deriving our results is the theory of 
local asymptotic normality (LAN) for quantum states 
[T9H22] which is the quantum extension of a fundamen- 
tal concept in mathematical statistics introduced by Le 
Cam [23] . In the classical context this roughly means 
that a large i.i.d. sample Xi, . . . , X n from an unknown 
distribution contains approximately the same amount of 
statistical information as a single sample from a Gaussian 
(normal) distribution with unknown mean and known 
variance. 

In the quantum case, LAN means that the joint 
(mixed) state pf n of n identically prepared (finite dimen- 
sional) quantum systems can be transferred by means of a 
quantum channel to a quantum-classical Gaussian state, 
with asymptotically vanishing loss of statistical informa- 
tion. More precisely, for any fixed 0o there exist quantum 
channels T n , S n such that 
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The paper is organised as follows. In Section [TT| we 
give a precise statistical formulation of the benchmark 
problem, and we explain in some detail the definition of 
the asymptotic risk. A brief overview of the necessary 
classical and quantum concepts from the LAN theory is 
given in Section |III| In Section |IV| we then revisit the 
benchmark problem for displaced thermal states. When 
the figure of merit is the overlap fidelity, the solution was 
found in [9] to be the heterodyne detection followed by 
the repreparation of a coherent state. We solve the same 
problem using the trace norm loss function, and again 
find this MAP scheme to be optimal. Interestingly, the 
optimality proof is based on the concept of stochastic 
ordering which was previously used for finding optimal 
cloning maps for thermal states [25]. This result allows 
us to exploit LAN and solve the benchmark problem for n 
i.i.d. qubits in the asymptotic limit n — » oo. A construc- 
tive solution along with a proof of optimality is given in 
Section [V] We conclude with discussions in Section IVII 
The Appendix contains additional mathematical details 
on the LAN theory for qubits. 
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II. STATISTICAL FORMULATION OF THE 
BENCHMARK PROBLEM 

The general statistical formulation of quantum bench- 
marking is as follows. Let 



Q:= {pe'.Oe 6} 



converge to zero as n —> oo, uniformly over a n _1 / 2+e 
local neighbourhood of the state pe . Here N u is a clas- 
sical normal distribution and $> u is a Gaussian state on 
an ensemble of oscillators whose means are linear trans- 
formations of u and the covariance matrices depend only 
on 0q. The qubit case is described in detail in Section |Hl| 
and the precise result is formulated in Theorem |III.4| 

The LAN theory has been used to find asymptotically 
optimal estimation procedures for qubits [20] and qudits 
[2T] and to show that the Holevo bound for state esti- 
mation is achievable [24 . Here we use it to solve the 
benchmark problem for qubits by casting it into the cor- 
responding one for displaced thermal states. The follow- 
ing diagram illustrates the asymptotically optimal MAP 
scheme: the measurement M n consists in composing the 
channel T n with the heterodyne measurement H. The 
preparation procedure consists in creating the coherent 
state \ctu) and mapping it back to the qubits space by 
the channel S n . The optimality of the scheme is proved 



be a quantum model, i.e. a family of quantum states 
on a Hilbert space H, indexed by a parameter G 6. 
In this paper G is always an open subset of R fc , i.e. we 
are in a parametric set-up. The model encodes our prior 
information about the state and the parameter is con- 
sidered to be unknown. We are given a quantum system 
prepared in the state pe and we would like to find the 
MAP (or classical) channel T : Ti(H) -> Ti(U) for which 
T(pe) is 'close' to pe. A MAP channel is the composition 
T = PoM of a measurement M with outcomes in a mea- 
sure space (Af,E), and a repreparation P which assigns 
to every result x G X a state p x G Ti(H). There are sev- 
eral natural distance functions on the state space, such 
as the Bures distance [26 with its associated Uhlmann 
fidelity [27] , or the trace- norm distance considered in this 
paper. Since is unknown we choose the maximum risk 
as the overall figure of merit of a scheme L := (M, P) 

RmM := sup \\T(p e ) - p e \\u T = P o M. (2) 
eeo 
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Alternatively one may use a Bayesian risk where the per- 
formance at different parameters is weighted by a prior 
distribution over 0. The goal is to find a MAP scheme 
with the lowest risk R max (L), which will be called a min- 
max scheme. 

The above formulation is particularly suitable in the 
case of covariant models such as the displaced thermal 



state treated in Section IV or the single qubit case of 
[TO] . Let us consider now the scenario where we want to 
teleport n identically prepared systems, such as in the 
case of atomic clouds. The relevant model is then 

Q n := {pf n :6&e}, 

and we would like to find the optimal MAP scheme L n := 
(M n ,P n ) for a given n. The experience accumulated in 
the related domain of state estimation indicates that this 
problem can rarely be solved explicitly [28] , but becomes 
tractable in an asymptotic framework [20j [21] . We adopt 
this set-up in our benchmark problem and define the local 
risk of a procedure L n := (M n , P n ), around Oo G by 

i?max(£n;0o) := SU P \\T n ( P f n ) - pf U || 1 . 

\\pe-pe \\i<n-^^ 

where unlike formula ([2| we take the maximum over a 
neighbourhood of Oo of size n - 1 / 2 + e with < e <C 1 
arbitrary. 

The asymptotic local risk of the sequence {L n := 
(M n ,P n ) : n G N} is defined by 

R({L n : n G N};0 ) := limsup i? max (L n ; 6> ), (3) 

n— )>oo 

and by definition a MAP sequence {L n : n G N} is opti- 
mal if it achieves the lowest possible asymptotic local risk 
at any point Oo £ ©• The latter is called the asymptotic 
minmax risk 



R 



minmax 



lim sup inf # max (L n ; O ) . 



Since the above formula may look rather ad-hoc to the 
reader who is not familiar with statistical methodology, 
we would like to explain its meaning in some detail. First 
of all, note that we compare the input and output states 
globally rather than locally on each system. This means 
that even though as n — >• oo we get more and more in- 
formation about the parameter and we can estimate 
it with accuracy 0(n -1 / 2 ), in the same time the task of 
repreparing the state pf n becomes more and more diffi- 
cult! This can be easily understood by looking at fidelity 
in the case of pure states. Let n be an estimator of 
obtained by measuring ipf n , so that n — — 0(n -1 / 2 ), 
and suppose that we reprepare the state . Then 



(tff n |Vf n n }f = (cosa„) 2 " = 1 - al/2 + oin- 1 ) 

where a n is an angle of order n -1 / 2 . Since 

/ c \ 2n 

[1 - - + oipT 1 )} —> exp(-2c), 



we see that the input-output fidelity cannot converge to 
1. We will show that this is the case for arbitrary states, 
and also when we allow for other MAP schemes. 

The second remark concerns the supremum over the 
small ball \\pe ~ Pe\\i < n~ 1 / 2+e in definition Why 
not consider the supremum over all as we did in the non- 
asymptotic case ? The reason is that the global supre- 
mum would be overly pessimistic and would be domi- 
nated by the region in the parameter space which is hard- 
est for the benchmark problem. Restricting to a ball 
whose size is roughly that of the uncertainty in the pa- 
rameter captures the local behaviour of MAP scheme at 
each point and is more informative than the global max- 
imum. The ball should have size n - 1 / 2 + e because even if 
9 is unknown beforehand, it can be localised within such 
a region by measuring a small proportion n 1_e <C n of 
the systems, so that effectively we know that we are in 
the local ball, and this should be reflected in the defini- 
tion of the risk. The localisation argument is standard 
in statistics and its application in quantum statistics is 
detailed in [20] . 

Finally, the relation between our figure of merit and 
the Bayes risk can be sketched as follows. If R n denotes 
the asymptotically optimal Bayes risk for the prior 7r, 



<g>n I 



R* := lim sup inf f ir{d6)\\T n {pf n ) - pf 



then under suitable conditions on tt and the model Q one 
obtains 



R n = J 7r(d0)R 



minmax 



(61). 



The intuitive explanation is that when n — >• oo the fea- 
tures of the prior tt are washed out and the posterior 
distribution concentrates in a local neighbourhood of the 
true parameter, where the behaviour of the MAP proce- 
dures is governed by the local minmax risk. A full proof 
of this relation is beyond the scope of this paper (and will 
be presented elsewhere) but the interested reader may 
consult [29] for the proof of the corresponding statement 
in the case of state estimation. 



III. LOCAL ASYMPTOTIC NORMALITY 

In this Section we will give a brief, self-contained in- 
troduction to the theory of LAN in as much detail as it 
is necessary for this paper and we refer to [20] for proofs 
and more analysis. 

LAN is a fundamental concept in mathematical statis- 
tics introduced by the French statistician Le Cam [23] . 
It roughly means that a large i.i.d. sample X±, . . . ,X n 
from an unknown distribution contains approximately 
the same amount of statistical information as a single 
sample from a Gaussian distribution with unknown mean 
and known variance. More precisely if Xi has distribution 
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P<9 depending 'smoothly' on a finite dimensional param- 
eter G 6 C R k , then in an 1//2 -size neighbourhood of 
any point 9q, the statistical model 

V n := {vi +u/ ^:u&R k } 

is well approximated by a simpler Gaussian shift model 

AT := {N (i£,/ _1 (0 o )) :nGi fe } 

where u G M fc is the local unknown parameter of the dis- 
tribution, and / -1 (#o) is the inverse Fisher information 
matrix at Oq • Note that this approximation holds only lo- 
cally, reflecting the intrinsic uncertainty in the unknown 
parameter for the sample size n, which should not be 
seen as an additional assumption about 6. Indeed, the 
parameter can always be localised in such a region us- 
ing a preliminary estimator, with vanishing probability 
of failure. The point of this approximation is to reduce 
a statistical problems (e.g. estimation) about the more 
complex model V n to a simpler problem about the Gaus- 
sian model M. 

Local asymptotic normality also provides a convenient 
description of quantum statistical models involving i.i.d. 
quantum systems. Here the idea is: when the quantum 
'sample' is large, the model can be approximated by a 
simpler quantum Gaussian model. If this approximation 
holds in a sufficiently strong sense, then statistical prob- 
lems about the qubits model can be reformulated in terms 
of the Gaussian one, without any loss of optimality. 

In quantum statistics this technique has been used for 
optimal state estimation with completely unknown finite 
dimensional quantum states [T9l [21] , for optimal classi- 
fication (learning) of spin states [31], for state transfer 
between matter and light [20 j. In the physics literature, 
LAN is used in an informal way to describe the dynamics 
of atomic gases in a simplified Gaussian approximation, 
in quantum memories and quantum metrology with spin 
coherent and squeezed states. 



A. LAN for qubit systems 

We are given n independent identically prepared spin- 
\ particles (qubits) in a state 

Pr= 2( 1+rCr ) 

where r is the Bloch vector of the state and 3 = 
{px,Vw>Vz) are the Pauli matrices in M(C 2 ). 

Although a priori the state is completely unknown, 
the following argument shows that that without loss of 
generality we can assume it to be localised within a small 
ball of size 77 / _1 / 2 + e where e > is arbitrarily small. In- 
deed by measuring a small proportion n 1_e <C n of the 
systems we can devise an initial rough estimator po : = Pf 
so that with high probability the state is in a ball of size 



n 1 / 2+e around po (see Lemma 2.1 in [19]). We label the 
states in this ball by the local parameter u 

Pu/^i = \ (1 + (r + u/Vn)a) 
and define the local statistical model by 

Q„:=K:NI<^}, ^:=P?/V (4) 




By choosing the reference frame with its z axis along 
fo we find that up to 0(n _1 ) terms the state p$/^ is 
obtained by perturbing the eigenvalues of po and rotating 
it with a 'small unitary' 



u/ y/n 1 



where 

Uti/^i '•= exp(i(-^cri + u x a 2 )/2roVn), r := ||f ||. 

B. The big Bloch ball picture 

The asymptotic behaviour of the multiple spins state 
can be intuitively explained through the 'big Bloch 
sphere' picture commonly used to describe spin coher- 
ent [32] and spin squeezed states [33 . Let 



La :=£^ 



x,y,z 



be the collective spin components along the reference 
frame directions. By the Central Limit Theorem, the 
distributions of L a with respect to pf n converge as 



1 



(L z - nr ) 



V 



V 



N(0,l-r 2 ), 
N(0,1), 



so that the joint spins state can be pictured as a vector 
of length nro whose tip has a Gaussian blob of size y/n 
representing the uncertainty in the collective variables 
(see Figure [TJ. Further more, by a law of large numbers 
argument we evaluate the commutators 



-L z 



- 2i — L z « 2zrol, 

n 

0. 



Thus the rescaled observables L x / V^r^n and L y / V^r^n 
converge to the canonical coordinates Q and P of a quan- 
tum harmonic oscillator. Moreover, the variances corre- 
spond to that of a thermal equilibrium state 



$:=(l-s)J2s k \k)(k\, 



k=0 
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Figure 1: (Color online) Big Bloch ball picture for n-qubit 
i.i.d. mixed states. 



where {\k) : k > 0} represents the Fock basis. 

As for the third component, (L z — nro)/y/n converges 
to a classical Gaussian variable X ~ N := 7V(0, 1 — r%) 
which is independent of the quantum state. 

How does the limit change when we perturb the state 
of the spins ? By the same argument we find that the 
variables Q,P,X pick up expectations which (in the first 
order in n -1 / 2 ) are proportional to the local parame- 
ters (u x ,u y ,u z ) while the variances remain unchanged. 
More precisely the oscillator is in a displaced thermal 
equilibrium state ^ : = D{u)QD(u)\ where D(u) is the 
displacement operator 

D(u) := exp (i(-u y Q + u x P)/y/2r^) , 

and the classical part has distribution N$ := N(u z , 1 — 
To 2 )- 

We have thus identified the limit Gaussian model 
which is a tensor product of a classical distribution and 
a quantum state, which together can be seen as a state 
(positive normal functional) on the von Neumann algebra 
B(£ 2 (N)) ®L°°(R). 

Definition III.l. The quantum Gaussian shift model Q 
is defined by the family of quantum- classical states 

:= ® JVtf :uGR 3 } (5) 

on B(£ 2 (N))®L°°(R). 

In the next subsection we formulate a precise state- 
ment about the convergence to the Gaussian model which 
goes beyond the Central Limit type argument presented 
above. 



C. Strong convergence to Gaussian shift model 

The notion of strong convergence of classical statistical 
models was introduced by Le Cam and is based on defin- 
ing a natural distance between statistical models with 
the same parameter space, so that models models at zero 
distance are statistically equivalent and models which are 
close, have similar behaviour for 'regular' statistical de- 
cision problems. The existing results on quantum suffi- 
ciency [34] and quantum LAN [2TJ [22] indicate the exis- 
tence of a theory of quantum decision and convergence 
of models. 

In the classical set-up the distance between models is 
defined operationally, in terms of randomisations which 
can be seen as the classical counterpart of quantum chan- 
nels. 

Definition III. 2. A positive linear map 

is called a stochastic operator (or randomisation) if 
TO ||i = IHIi for every p&L\{X). 

Since we work with models which may contain both 
classical and quantum 'states' we will call a channel, a 
completely positive, normalised map between preduals of 
von Neumann algebras. In finite dimensions, this reduces 
to the familiar notion of a channel, this time with block- 
diagonal input and output density matrices. 

Definition III.3. Let V := {p e : G 9} and Q := {a e : 

6 G 0} be two quantum statistical models over 6 with po 
and gq normal states of von Neumann algebras A and 
respectively B. 

The deficiencies 5(V, Q) and 5(Q,V) are defined as 

S(P,Q) := mfsup||T(^)-^||i 
T eee 

5(Q,P) := infsup||S(<7*)-p*||i 

s eee 

where the infimum is taken over all channels T, S and 
|| • ||i denotes the L\-norm on the preduals. 
The Le Cam distance between V and Q is 

A(7>,e) :=max(£(Q,n S(P, Q)). 

With this definition we can formulate the strong con- 
vergence of the sequence Q n of i.i.d. qubit models to the 
Gaussian limit. 

Theorem III. 4. Let Q n be the sequence of statistical 
models Q for n i.i.d. local spin-\ states, and let Q n 
be the restriction of the Gaussian shift model ^ to the 
range of parameters \\u\\ < n e . Then 

lim A(Q n ,G n ) =0, 

n— )-oo 

i.e. there exist sequences of channels T n and S n such that 
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lim sup ||*a®JV a -T n (p3)||i = 0, 
lim sup \\p%- S n ($ ii ®N ii )\\ 1 =0. 

™^°° ||u||<n e 



(6) 



Let us make a few comments on the significance of 
the above result. The first point is that LAN provides 
a stronger characterisation of the 'Gaussian approxima- 
tion' than the usual Central Limit Theorem arguments. 
Indeed the convergence in Theorem|III.4 is strong (in Li) 



rather than weak (in distribution), it is uniform over a 
range of local parameters rather than at a single point, 
and has an operational meaning based on quantum chan- 
nels. 

Secondly, one can exploit these features to devise 
asymptotically optimal measurement strategies for state 
estimation which can be implemented in practice by cou- 
pling with a bosonic bath and performing continuous 
time measurements in the bath [20] . 

Thirdly, the result is not restricted to state estimation 
but can be applied to a range of quantum statistical prob- 
lems involving i.i.d. qubit states such as cloning, telepor- 
tation benchmarks, quantum learning, and can serve as 
a mathematical framework for analysing quantum state 
transfer protocols. 

For completeness, we give a brief review of the main 
ideas involved in the proof of Theorem |III.4| and the de- 
scription of channels T n , S n in the Appendix. 



IV. QUANTUM BENCHMARK FOR 
DISPLACED THERMAL STATES WITH TRACE 
NORM DISTANCE 



In this Section we address the problem of finding the 
best MAP scheme for the Gaussian family of displaced 
thermal states with unknown mean and given variance 

T:={$ z :ze£}. 

To our knowledge this problem has only been solved in 
the case when the figure of merit is 'overlap fidelity' [9]. 
Here we show that the same procedure is optimal when 
the loss function is the trace norm distance. 

Let L := (M, P) be a MAP procedure and define the 
maximum risk as 

RmM = sup ||P o M{$ z ) - ^||i. 

zee 

The main result of this Section is the following. 

Theorem IV. 1. Let L* := (H,P) be given by hetero- 
dyne measurement followed by preparation of a coherent 
state centred at the outcome of the measurement. Then 
L* is minmax i.e. for any L 

-Rmax(-^*) < -Rmax(-^) 5 



Before proceeding with the proof let us recall some 
basic definitions. The quantum particle or 'one mode' 
continuous variables system is characterised by the Weyl 
(or CCR) algebra generated by the operators W% with 
£ £ C satisfying the commutation relations 

W^W C = W c W^exp(-am(£,C». 

The algebra is represented on the Hilbert space L 2 (R) as 
W% := exp(£a^ — £a) with a, the creation and annihi- 
lation operators. The latter are defined by their action 
on the Fock basis 



|fc) :=H k (x)e- x2/2 /^/k\2 k V^ fc = 0,1,2... 

such that a)\k) = Vk + l\k + 1) and a\k) = Vk\k - 1). 
A thermal state is a mixed state defined as 

CO 

$:=^(l- S ) S fe |fc)(fc|, 

where < s < 1 is a parameter related to the temper- 
ature by s = e _/3 , which will be considered fixed and 
known. The displaced thermal states are & z := W Z &W^. 
The heterodyne (or coherent) measurement is defined by 
itsPOVM 

H(d() := KXCK/27T 

where \() := W^\0) are the coherent states. For any state 
p the probability density of the heterodyne outcomes is 
the Q-function [30] P P (Q = (CHC)/2tt. Now if we het- 
erodyne the state $> z we obtain ( ~ N(z, 1(1 — s) -1 ) and 
by preparing the coherent state \() we get the average 
output state 

T($ z ) :=Foff($ z )=$ z , 

where $ is the thermal state with s = (2 — s) -1 . 

Lemma IV. 2. The risk of the measure and prepare 
strategy L* := (H,P) is 

iW^*) := supllPofr^-^ll! 

zee 

= R*(s) :=2(2-5)- m °- 1 -2s mo+1 , 
where mo is the integer part of — log (2 — s)/ log s(2 — s) . 

Proof. By covariance we have i? m ax(£*) = \\^ — 
Both states are diagonal and we denote their elements 
qi := (1 — s)s l and pi := (1 — s)s z so that 



For such geometric distributions there exists an integer 
mo such that p\ < q\ for m < mo and p\ > q\ for m > mo , 
more precisely 



and its risk R*(s) is given in Lemma IV. 2 



m = L- log(2 - s)/ log s(2 - s)\ . 
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Figure 2: (Color online) Risk of the optimal measure and 
prepare scheme as function of s — e _/3 . All the quantities 
plotted are dimensionless. 



In conclusion 

\\ p - q \\ 1 = 2j2<Ii-Pi = 2^ mo+1 - 2s mo+1 . 



□ 



Figure |2] shows the decay of the risk R(L*) as a function 
of the parameter s = of the input state <£. 



Proof. We start the proof of Theorem IV. 1| by following 
a standard argument [9] which shows that we can first 
restrict to phase space (or displacement) covariant, en- 
tanglement breaking channels, and then that it is enough 
to show that L* is optimal in a larger class of 'time- 
reversible' channels which can be easily characterised by 
their action on the Weyl operators Wf := exp(^a"*" — fa): 



(7) 



where / is a quantum characteristic function /(£) = 
Tr (tWz) for some state r with positive Wigner function, 
i.e. / is also a classical characteristic function. 

For such channels T = P o M, the risk is independent 
of z and is equal to 

i2max(£) = ||T($)-$||i. 

Now, since <1> is invariant under phase rotations, <E> = 
exp(i#iV)<I>exp(— iON), we can apply the covariance ar- 
gument again [25] to conclude that we can restrict to 
channels which are covariant under phase rotation, which 
amounts to taking r to be diagonal in the Fock basis. 
Since for diagonal states /(£) = we obtain the fol- 

lowing Schrodinger version of 

Tr(T($)^) = Tr(rW^)Tr($W 5 ) 
= Tr(rW^)Tr($W_ ? -) 
= Tr (r <g> <£exp(£c - £c t )) 



c : 



:= y/2b + o f 



(8) 



is the annihilation operator of a new mode. In other 
words, the output state is the same as the state of the 
'amplified' mode c when b and a are prepared in states r 
and respectively From this point we follow closely the 
arguments used in [25] which were originally devised for 
finding the optimal amplification channel for displaced 
thermal states. The candidate channels are labelled by 
diagonal matrices r and we denote by p T the probability 
distribution consisting of the elements of the output state 
pi = T T (<f>)i^. We further denote q { := &a = (1 - s)s i 
the geometric distribution of the input thermal state and 
by p 00 the distribution of the output state corresponding 
to uo = |0)(0|. It is easy to verify that T u is the channel 
associated to L* := (if, P), hence we would like to show 
that for any r 

l|p T -9l|i>ll^-9l|i- 

The proof is split into two parts and relies on the notion 
of stochastic ordering as the key ingredient. 

Definition IV. 3. Let p = {p t : / e N} and q = {q t : I e 

N} be two probability distributions over N. We say that 
p is stochastically smaller than q (p -< q) if 



J2pi>J2qI: Vm>0. 



J=0 



Lemma IV. 4. For any state r the following stochastic 
ordering holds: 

p" di f- 

Proof The first step is to 'purify' $ by writing the mode 
a as one of the outputs of a degenerate parametric am- 
plifier [35] 



a = cosh(t)ai + sinh(t)a2 



with ai ? 2 in the vacuum. If tanh(t) 2 
of a is Plugging into (l8|) we get 



s then the state 



c = sinh(t)a{ + cosh(t) (Ta 2 + Rb) 

where sinh(t) = cosh(t),T = sinh(t)/ cosh(f) and R = 
v / 2/cosh(t) with T 2 + R 2 = 1. Physically this means 
that the modes a 2 and b are mixed with a beamsplitter 
and then a different degenerate parametric amplifier is 
applied together with mode a\ which is in the vacuum. 
Note that b is in the vacuum state if and only if b := 
Td2 + Rb is also in the vacuum. For general diagonal 
states r the mode b is in the state f given by the binomial 
formula [30] 



r)T 2{p - k) R 2k \p)(p\ = 5>ip><p|. 
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The result of the purification argument is that we have 
reduced the problem of proving stochastic ordering for 
states of c = + V2b when a is in a thermal state, to 
the analogue problem for c = smh(t)cfi + cosh(t)b with 
a := di in the vacuum. Since stochastic ordering is pre- 
served under convex combinations, it suffices to prove the 
statement when f = \k)(k\ for any k ^ 0. 

The following formula [35] gives a computable expres- 
sion of the output two-modes vector state of the amplifier 



e rat6t e -g(ata+6t6 + i) e _ra6| 0)A;) 



OO / 

1=0 ^ 



l + k 
k 



1/2 



|Z,Z + fc), 



where T = tanh(t) and = cosh(t). By tracing over the 
mode a we obtain the desired state of c 



2( '-%_^iio</|. 



The relation p^ ^ p r reduces to showing that 



1=0 



Z=0 



for all m. If m < fc the right side is equal to zero and the 
inequality is trivial. With the notation 7 = T 2 we get 



p+k 



£d? = (i- 7 ) fe+1 E- 



£=0 



1=0 



l + k 



(1 - 7) /c+1 (\ - 7 /c +p+ 1 



fc! 



v p+i 



1-7 



(k) 



< l_ 7 P+l^(l_ 7 )r 7 fc-r/'' 
r=0 ^ 

= i-7^ +1 = E^<E^ 



-p+ 1 

r 



1=0 



1=0 



□ 



Stochastic ordering can be transformed into the desired 
optimality result by a standard argument [25] which has 
some interest in its own and is summarised in the follow- 
ing lemma. The key property nee ded he re is that pf < q\ 
if and only if I < mo (see Lemma IV.2). 

Lemma IV. 5. The following inequality holds for any r 



l|p T -9l|i> 11^-9111- 



Proof. Define 



m 



a := max(m : ^ q t < a) 
1=0 



and V(a,r) = {D C N : E /^p r * < a }, for a11 a > °- 
Note that by Lemma IV. 4 we have ^J=o^ r — a f° r a ^ 
r, and thus {0, 1, ... , m a \ G V(a, r). Using the relation 
lb — 9 111 = 2sup D ^2 teD (pi — qi) we obtain the chain of 
inequalities 



lk-Pli 



2 sup sup ^2(qi-p[) 

a>0 DeT>(a,r) leD 



> 2 sup Eta >2snpJ2^i-Pi) 

a>0 ~~ a>0 ~~ 



1=0 



1=0 



2 sup ^2(qi 



m>0 



Pi 



1=0 



The first equality follows directly form the definition of 
P(a,r). The subsequent inequality restricts the supre- 
mum over all D G T>{a, r) to one element {0, 1, ... , m a }. 
In the second inequality we replace the distribution p T by 
p u using the stochastic ordering proved in Lemma |IV.4| 
In the subsequent equality we used the fact that both 
distributions p and (f are geometric. 

□ 



We have completed the proof of Theorem IV. 1[ show- 
ing that, also with respect to the trace norm figure of 
merit, a heterodyne measurement followed by the prepa- 
ration of a coherent state centred at the measurement 
outcome yields the optimal MAP scheme for the trans- 
mission of displaced thermal states. □ 



V. ASYMPTOTIC QUANTUM BENCHMARK 
FOR INDEPENDENT QUBITS 

Here we tackle the benchmark problem for n i.i.d. 
qubits in the asymptotic limit n -+ 00, we reformulate it 
in terms of the local coordinates according to the recipe 
described in Section [H| and we obtain a constructive so- 
lution to it, by proving that this benchmark problem is 
equivalent to the one for displaced thermal states dis- 
cussed and solved in Section |IV| The rigorous link be- 
tween the two settings is provided by the LAN theory, 
whose main concepts are reviewed in Section |III[ and 
which will constitute an essential mathematical tool for 
the findings of this Section. 

Given n i.i.d. spin-^ particles in state p? n , we per- 
form a measurement M n with outcome X n ~ P^ 71 in a 
measure space (Af n ,E n ) and reprepare a quantum state 
u(X n ) on (C 2 ) 0n . The MAP channel is L n := P n o M n 
and only such channels will be considered in our op- 
timisation. A full MAP protocol will be denoted by 
L:={L n :ne N}. 

The risk (figure of merit) of the protocol at r for a 
given n is 



R(f,L n ) 



\pf n -E(u(X n 



(9) 



Pf 



X 
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As a measure of the overall performance of L n we con- 
sider the local maximum risk a neighbourhood of fo 



sup i?(r, 

||r-r ||<n- 1 / 2 + e 



whose asymptotic behaviour is 

R(ro,L) := limsupi? n 



(10) 



(11) 



Definition V.l. The local minmax risk at fo is defined 
by 



R 



minmax 



(fo) := limsup inf R n 

n— foo L n 



t(r ,L n )- 



A protocol L is called locally mimimax at fo if R(ro,L) 

-^minmax (^0 ) • 



In Theorem |V.2| we show that the following sequence 
of MAP maps is locally minmax. 

Adaptive measure and prepare protocol: 

1. The state is first localised in a neighbourhood ||f — 
fb|| < n _1 / 2+e of fo by u sing a small proportion of 
the systems (see Section III). 



2. The remaining spins are mapped by the channel 
T n close to the classical-quantum Gaussian state 
$u ® as in Theorem |III.4[ 

3. A heterodyne measurement together with an obser- 
vation of the classical component give an estimator 
u of u. 

4. A coherent state \a%) is prepared with the mean 
equal to the outcome of the heterodyne measure- 
ment. The inverse channel S n is applied to the 
coherent state and the classical part. 

The procedure is illustrated in the commutative diagram 
([I]) drawn in the introduction: the upper line represents 
the MAP steps which are realised through the alternative 
route described in the lower part. Thus M n = H o T n 
and P n := S n o P. 

Theorem V.2. The sequence of MAP maps 

L* n := (M n := H o T n , P n := S n o P) 

is locally asymptotically minmax. The minmax risk at 
fo is equal to the benchmark for the MAP problem of a 
displaced thermal equilibrium state with parameter s = 
(1 — ro)/(l + ro) (see Theorem IV. 1) 



minmax 



{r )=R{r ,L*) = R, 



•mm max 



Proof. The idea is that the spins problem can be trans- 
ferred the Gaussian one with vanishing difference in the 
risks. We first show that i?max(fo,£*) < ^minmax(s) and 
then argue that a strict inequality would be in contradic- 
tion with the optimality of i? m inmax(s). 



By Lemma 2.1 in [20] the first step of the procedure has 
o(l) failure probability, in which case the output state 
may be very different from the desired one. However 
since the norm distance between states is bounded by 
2, an application of the triangle inequality shows that 
this has no influence on the asymptotic risk. Thus we 
can consider that f is in the local neighbourhood ||f — 
fo|| < n _1 / 2+e anc [ we can apply the LAN machinery of 
Section HIB 

The (average) output state is 



(jj r 



:= E(w(X n )) = S n (E [\a,)(a,\] ® N a 



where the expectation on the right side is over the hetero- 
dyne measurement outcomes. Since measurements and 
preparations are contractive we have 

||E [laaXaaG-dall! < \\TM-*a®N a \\i = o(l) (12) 

where <l>^ is the displaced thermal equilibrium state of 
variance l + l/2ro obtained by heterdyning <I>^, preparing 
the coherent state \a^)(a^\ and averaging over (^l,^)- 
On the other hand, by using contr activity of S n 

\\Pa-"n\\i 

< M - S„(*u ® ^3) 111 + - E [|a 6 )K|] |U 
= ||# a -E[|a a ><a s |]|| 1 + (l). (13) 



From (|12|) and (13) we get 



i?max(r ,L*) < ||$ ~ni = R 



minmax 



(s). 



As shown in Theorem IV. 1[ the right side is the MAP 
benchmark for mixed Gaussian states with optimal pro- 
cedure consisting of heterodyne measurement followed by 
the preparation of a coherent state with mean equal to 
the outcome of the measurement. 

Now we show that no MAP strategy can achieve a 
lower asymptotic risk at fo than i? m inmax(s). Indeed sup- 
pose that L := (M n ,P n ) is a sequence of procedures for 
spins which satisfies i? m ax(fo?£) < ^minmax (5). Then as 
shown below, we could construct a MAP procedure for 
which is strictly better that the optimal one, which 
is impossible. 

Let 5 > be a small number to be fixed later. We mix 
with the thermal state of the same temperature ^>o, 
through a beamsplitter with small reflectivity r = S and 
transmissivity t = y/l — S 2 and obtain the output 

By heterodyning ^> r ^ we obtain an estimator uq such that 
P[||tf-iZo|| >L] <e 2 /2 

for some (large) L which increases when (5, e 2 I 0. 

We displace the unmeasured component ^> t n by —tuo 
so that from now on we can assume that \\u\\ < L (with 
an 62 loss of risk). 
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Now we choose n large enough so that L < n e . Ther- 
mal states with such displacements are in the range of ap- 
plicability of the inverse map S n in LAN Theorem |III.4 



We apply the channel S n mapping the Gaussian state to 
an i.i.d. spins state, with the small difference that the 
classical parameter is now fixed to zero, i.e. the spins will 
be prepared in a state with Bloch vector of length tq. By 
Theorem III. 4 we have uniformly in \\u\\ < L 



\\S n ($ ta ®N )-pUi=o(l). 

Next we measure and re-prepare the spins using the 
procedure L and obtain a state uo n := E [cj(X n )] such 
that 

IK - P^ll 1 < #max(r , L) + 0(1). 

Finally, we apply the map T n to the output state 
oj(X n ) and keep only the quantum part T^ q \uj n ). By 
the same contractivity argument as before we have 

H^-t^K)!!! < ||$ t3 -T(")(^)l|i + IK-^lli 

< R max 

(f ,L)+o(l). (14) 

At this point we can directly compare out state 
Tn Q \oj n ) with the target <£^, or use a quantum ampli- 
fier to make up for the loss in amplitude induced by the 
initial use of a beam splitter. We follow the second 'un- 
biased' line. The amplifier can be described by a linear 
transformation on the mode a of the oscillator together 
with an ancillary mode b prepared in the vacuum. The 
output modes are 



ci := 
C2 := 

Let A denote the channel mapping the state of the mode 
a to that of the amplified mode c\. Then 

A^tu = % 

where is a thermal equilibrium state with variance 
V = t~ 2 /2r + (r 1 - l)/2. 

The final distance estimate (conditional on successful 
localisation of is 

||* u - a o r,<*>K)||i < ||$ 3 - + ||* t3 - r n (w n )||i 

< ||$-$'||i+i? max (ro,i) + o(l). 

(15) 




Since we assumed that i? ma x(^b? L) < ^minmaxl^), it is 
enough to choose €2 and ||$ — $'||i small enough to obtain 
a contradiction with the optimality of i? m inmax(<s). This 
can be done by choosing S, 62 small enough and n large 
enough. 

□ 



VI. DISCUSSION AND CONCLUDING 
REMARKS 

The problem of finding an optimal MAP reconstruc- 
tion scheme for a family of quantum states has attracted 
significant attention due to its relevance in establishing 
fidelity benchmarks for teleportation and state storage. 
In the case of a family of displaced thermal (and coher- 
ent) states with unknown displacement the problem was 
solved in [6j [9] and the optimal procedure is the hetero- 
dyne measurement followed by the preparation of a co- 
herent state. We showed that the same MAP procedure 
is again optimal for this family of states, with a more nat- 
ural figure of merit - the trace norm distance. Moreover, 
in the case of i.i.d. mixed qubit states, the benchmark 
problem can be solved asymptotically by mapping it to 
the previous problem, using LAN theory. 

Interestingly, the same heterodyne measurement is also 
optimal from the point of view of state estimation [36] . 
On the other hand, in [10] it was shown that for a par- 
ticular family of states in C 2 , the optimal MAP scheme 
involves a measurement which is different from the opti- 
mal one for state estimation. One of our motivations was 
to see whether this peculiarity survives in the asymptotic 
limit, and the conclusion is that for large n the estimation 
and benchmarking can be performed optimally simulta- 
neously, i.e. their measurement parts are identical. 

Another difference between benchmarking and estima- 
tion pointed out in [10] refers to the preparation part of 
the protocols. More exactly, it turns out that in the case 
of a special family of one qubit pure states, the optimal 
re-prepared state does not belong to the family. How- 
ever, our asymptotic benchmark for qubits can be easily 
extended to treat the case of pure rather than mixed 
states and the result is that asymptotically, the optimal 
re-prepared state is in the original model. Thus the ef- 
fect pointed out in [10] is due to the particular geometry 
of the states space in the one sample situation, which 
'linearises' asymptotically. To briefly explain our claim, 
note that asymptotically, the parameter space reduces 
effectively to a small interval and the tangent space ap- 
proximation kicks in. Then the problem is transferred to 
that of finding the benchmark for a family of coherent 
states on a line, where the solution is a homodyne mea- 
surement followed by the preparation of a coherent state 
in the family. 

Oh the other hand, we have seen that in the case of 
mixed states, the re-prepared state is pure, hence not 
in the family, even in the asymptotic framework. How- 
ever this fact is not surprising as it happens already in 
the classical case: if X is a random variable with dis- 
tribution P then the best 're-preparation' is the 'pure 
state' represented by the 5 measure Sx leading to a triv- 
ial benchmark. If P is not a S measure itself then the 
re-prepared 'state' Sx is outside the model. 
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Appendix: Explicit construction of the channels 



We give here a brief review of the main ideas involved 



in the proof of Theorem III. 4 and the description of chan- 
nels T n ,S n . 

On (C 2 )^ we have two commuting unitary group rep- 
resentations 

K n (U) : ® • • • H> f/^i <8> • • • <g) U^ n , 
7r(t) : ^1 ®'-®^n "-^^-1(1) ®i/>t-Hn) 

where U G SU(2) and £ G S'(n), with denoting 
the symmetric group. By Weyl's Theorem, the repre- 
sentation space decomposes into a direct sum of tensor 
products 



n/2 



(c 2 r= e n^m 

3=0,1/2 



(A.1) 



where j is half-integer, = C 2j+1 is an irreducible rep- 
resentation of SU(2), and W n = C nj is the irreducible 
representation of S(n). Since the density matrix p'l is in- 
variant under permutations, it has a block diagonal form 



p n u= ©pSOK*®-^- 



(A.2) 



j=o,i 



This can be interpreted as being given a random vari- 
able J with distribution p^(j) and conditionally on J = 
j, a quantum state p 7 ? The classical and quantum com- 
ponents of the 'data' will be 'processed' by randomising 
J and mapping - through a quantum channel. 

Classical component. — Each block is an eigenspace of 
the total spin operator L 2 = L 2 + L 2 + L 2 Z with eigen- 
value + 1) ~ (2j) 2 . As the big Bloch ball argument 
suggests, the main contribution to L 2 comes from L 2 so 
that the distribution p^(j) can be approximated by the 
binomial 



PcU) 



n 

j + n/2 



l + rV +n/2 



n/2-j 



with r = ro + u z . The fact that p^(j) converges to N$ 
follows now from the classical version of LAN [23] for 



i.i.d. samples of binary variables. We first constructs the 
rescaled variable 

G n '= Vn - r ^j N(u z , 1 - r ), 

but since G n has a discrete probability distribution, we 
need to 'smooth' it by randomising with e.g. a Gaussian 
Markov kernel of variance l/(2y/n) 

K nJ (x) := (n 1 / 4 /^) exp (-yfr(x - G n (j)) 2 ) . 

In this way the convergence in distribution is converted 
into strong (L\) convergence. 

Quantum component. — Conditionally on obtaining 
J = j in the which-block measurement, we are left with 
a quantum state p™- on Hj. The action of T n will be to 
imbed it into the quantum oscillator space by the isom- 
etry Vj : Uj -> £ 2 (Z) define below. 

Let TTj denote the irreducible representation of SU(2) 
on T~Lj and denote its generators by Lj yCL = 7Tj(cr a ). The 
space has an orthonormal basis m), m = — 2j, . . . , 2j} 
such that 

L jA^ m ) rn\j,m). 



The Quantum Central Limit Theorem suggests that 
the properly normalised operators Lj^± := Lj jX ± iLj jV 
converge to the annihilation and creation operators a, a) 
when j « nro/2 oo. Since they act as ladder operators 



J j,+\ji m ) = Vj - m Vj + ?™ + i |j,m + 1), 

Jj-\j,m) y/j - m + li/j + m | j, m - 1). 
it is natural to define the isometry 

: |m, j) ^ \j - m) 
defining the embedding (channel) 

Putting everything together we obtain the channel T n 



which implements the convergence to Gaussian in Theo- 
rem nLii 

The channel S n is basically an inverse of T n : the nor- 
mal distribution is discretised to produce the distribution 
and the quantum Gaussian state is compressed to the 
first 2j + 1 levels and mapped to Hj with the co-isometry 
V}. For more details on the proof we refer to [20] . 
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